Emotional speech synthesis for emotionally-rich virtual worlds
نویسنده
چکیده
This paper aims to give a brief overview of the current state of the art in emotional speech synthesis in view of a multi-modal context. After a brief introduction into the concept of text-to-speech synthesis, two approaches to the expression of emotions in speech synthesis are described. The categorical approach models emotions as discrete categories and is able to provide high-quality emotional speech for a few emotion categories; the dimensional approach uses emotion dimensions such as activation and evaluation to model essential emotional properties, leading to more flexible but less specific expressions. Architectural requirements for an audio-visual integration are outlined. Three examples of demonstrators illustrate the types of applications we currently envisage. Finally, the question of validation of a generation system is formulated, and a direction for the development of possible answers is suggested.
منابع مشابه
Trackside DEIRA: a dynamic engaging intelligent reporter agent
DEIRA is a virtual agent commenting on virtual horse races in real time. DEIRA analyses the state of the race, acts emotionally and comments about the situation in a believable and engaging way, using synthesized speech and facial expressions. In this paper we discuss the challenges, explain the computational models for the cognitive, emotional and communicative behavior, and account on impleme...
متن کاملAutomatic Recognition of Emotionally Coloured Speech
Emotion in speech is an issue that has been attracting the interest of the speech community for many years, both in the context of speech synthesis as well as in automatic speech recognition (ASR). In spite of the remarkable recent progress in Large Vocabulary Recognition (LVR), it is still far behind the ultimate goal of recognising free conversational speech uttered by any speaker in any envi...
متن کاملPerception of emotional congruency in multimodal speech synthesis
This working paper experimentally investigates the perception of emotional congruency in multimodal speech synthesis. Therefor two perceptual experiments are described. Experiment 1 is a preliminary test exploring inhowfar subjects are able to identify emotions in synthetic speech as well as in faces presented in short video-clips. Results show that subjects find it easier to recognize emotions...
متن کاملVerification of Acoustical Correlates of Emotional Speech using Formant-Synthesis
This paper explores the perceptual relevance of acoustical correlates of emotional speech by means of speech synthesis. Besides, the research aims at the development of »emotionrules« which enable an optimized speech synthesis system to generate emotional speech. Two investigations using this synthesizer are described: 1) the systematic variation of selected acoustical features to gain a prelim...
متن کاملVerification of Acousical Correlates of Emotional Speech using Formant− Synthesis
This paper explores the perceptual relevance of acoustical correlates of emotional speech by means of speech synthesis. Besides, the research aims at the development of »emotion− rules« which enable an optimized speech synthesis system to generate emotional speech. Two investigations using this synthesizer are described: 1) the systematic variation of selec− ted acoustical features to gain a pr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003